A Multi-Strip Algorithm and Its Application to Gene Characterization Using DNA-Array Data
نویسندگان
چکیده
A fast adaptive multiscale algorithm has been devised to characterize a random set of points spanning a high dimensional Euclidean space, but concentrated around special lower dimensional subsets. It has been adapted to analyze gene expression data from microarray experiments. We present here the simplest version of this “multi-strip” algorithm applied to a set of points in R concentrated around a line. The algorithm characterizes this set by finding a strip around the principal axis of the set, so that it isolates deviating points from the main bulk of points enveloped by the strip. The algorithm generalizes to computing a strip around a best L d-plane, where 1 ≤ d < D, or even fitting a strip around a d-dimensional Lipschitz graph. We establish various estimates for its performance. When applied to gene-expression data, the algorithm can be thought of as estimating the local statistics (means, standard deviations, tail distributions, etc.) as a function of the entire expression range. Genes with abnormal differential expression values can be identified and given biological interpretations based on the local deviations in their statistics. By avoiding rigid local segmentations (as in segmental nearest neighbor normalization) or nonadaptive global estimates, the algorithm achieves a superior performance.
منابع مشابه
A MULTI-OBJECTIVE EVOLUTIONARY ALGORITHM USING DECOMPOSITION (MOEA/D) AND ITS APPLICATION IN MULTIPURPOSE MULTI-RESERVOIR OPERATIONS
This paper presents a Multi-Objective Evolutionary Algorithm based on Decomposition (MOEA/D) for the optimal operation of a complex multipurpose and multi-reservoir system. Firstly, MOEA/D decomposes a multi-objective optimization problem into a number of scalar optimization sub-problems and optimizes them simultaneously. It uses information of its several neighboring sub-problems for optimizin...
متن کاملImproved Automatic Clustering Using a Multi-Objective Evolutionary Algorithm With New Validity measure and application to Credit Scoring
In data mining, clustering is one of the important issues for separation and classification with groups like unsupervised data. In this paper, an attempt has been made to improve and optimize the application of clustering heuristic methods such as Genetic, PSO algorithm, Artificial bee colony algorithm, Harmony Search algorithm and Differential Evolution on the unlabeled data of an Iranian bank...
متن کاملA full ranking method using integrated DEA models and its application to modify GA for finding Pareto optimal solution of MOP problem
This paper uses integrated Data Envelopment Analysis (DEA) models to rank all extreme and non-extreme efficient Decision Making Units (DMUs) and then applies integrated DEA ranking method as a criterion to modify Genetic Algorithm (GA) for finding Pareto optimal solutions of a Multi Objective Programming (MOP) problem. The researchers have used ranking method as a shortcut way to modify GA to d...
متن کاملCircularly Polarized Circular Slot Antenna Array Using Sequentially Rotated Feed Network
This paper presents the design, simulation, and measurement of two low-cost broadband circularly polarized (CP) printed antennas: a single element and an array at C band. The proposed single element antenna is excited by an L-shaped strip with a tapered end, located along the circular-slot diagonal line in the back plane. From the array experimental results, the 3 dB axial ratio bandwidth can r...
متن کاملMitochondrial DNA characterization of Sergentomyia sintoni populations and finding mammalian Leishmania infections in this sandfly by using ITS-rDNA gene
Sergentomyia sintoni is the natural vector of Sauroleishmania species of lizards. This sandfly isabundance in and around the burrows of great gerbils. S. sintoni was collected from peridomestic animalshelters, inside and around houses and also from the nearby burrows of the gerbil reservoir hosts,Rhombomys opimus, in several provinces of Iran. Mitochondrial Cytochrome b (Cyt b) of sandflies, wh...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003